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1. Introduction 


Let Xi,Xn be n > 1 independent copies of a non-negative random variable (rv) X, defined 
over some probability space (ff, A, P) , with a cumulative distribution function (cdf) F. These 
rv’s are censored to the right by a sequence of independent copies hi,of a non-negative 
rv y, independent of X and having a cdf G. At each stage 1 < j < n, we can only observe the 
rv’s Zj := mm{Xj,Yj) and 6j := l{Xj < Yj} , with 1 {■} denoting the indicator function. 
The latter rv indicates whether there has been censorship or not. If we denote by H the cdf of 
the observed Z's, then, by the independence of X and Y, we have 1 — H = {1 — F) {1 — G). 
Throughout the paper, we will use the notation S{x) := S(oo) — S(x), for any S. Assume 
further that F and G are heavy-tailed or, in other words, that F and G are regularly varying 
at infinity with negative indices — 1/71 and — 1/72 respectively. That is 


lim and lim ( 1 . 1 ) 

p G (t) 

for any x > 0. This class of distributions includes models such as Pareto, Burr, Frechet, 
a—stable (0 < a < 2 ) and log-gamma, known to be very appropriate for fitting large insur¬ 
ance claims, large fluctuations of prices, hnancial log-returns, ... (see, e.g.. Resnick, 2006). 
The regular variation of F and G implies that H is regularly varying as well, with index 
— 1/7 where 7 := 7172 / (71 + 72 ) • Since weak approximations of extreme value theory based 
statistics are achieved in the second-order framework (see de Haan and Stadtmiiller, 1996), 
then it seems quite natural to suppose that cdf F satishes the well-known second-order 
condition of regular variation. That is, we assume that for any x > 0 


Up (tx)/U f (t) — x'^^ 
hm-^——- 

t^OO Aj (f) 


= x^i 



( 1 . 2 ) 


where r < 0 is the second-order parameter and is a function tending to 0 , not changing 
sign near inhnity and having a regularly varying absolute value at inhnity with index r. If r = 
0, interpret (x”" — 1) /r as logx. In the sequel, the functions (s) := inf {x : K (x) > s} , 
0 < s < 1 , and Lx (t) := {1 — l/t), t > 1, respectively stand for the quantile and 

tail quantile functions of any given cdf K. The analysis of extreme values of randomly 
censored data, is a new research topic to which Reiss and Thomas (2007) made a very brief 
reference, in Section 6.1, as a hrst step but with no asymptotic results. Beirlant et al. (2007) 
proposed estimators for the extreme value index (EVI) 71 and high quantiles and discussed 
their asymptotic properties, when the data are censored by a deterministic threshold. For 
their part, Einmahl et al. (2008) adapted various classical EVI estimators to the case where 
data are censored, by a random threshold, and proposed a unihed method to establish their 
asymptotic normality. Their approach is used by Ndao et al. (2014, 2016) to address the 
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nonparametric estimation of the conditional EVI and large quantiles. Based on Kaplan- 
Meier integration, Worms and Worms (2014) introduced two new estimators and proved 
their consistency. They showed, by simulation, that they perform better, in terms of bias 
and mean squared error (MSE) than the adapted Hill estimator of Einmahl et al. (2008), in 
the case where the tail index of the censored distribution is less than that of the censoring 
one. Brahimi et al. (2015) used the empirical process theory to approximate the adapted Hill 
estimator in terms of Gaussian processes, then they derived its asymptotic normality only 
under the usual second-order condition of regular variation. Their approach allows to relax 
the assumptions, made in Einmahl et al. (2008), on the heavy-tailed distribution functions 
and the sample fraction of upper order statistics used in estimate computation. Recently, 
Beirlant et al. (2016) developed improved estimators for the EVI and tail probabilities by 
reducing their biases which can be quite substantial. In this paper, we develop a new 
methodology, for the estimation of the tail index under random censorship, by considering 
the nonparametric estimator of cdf F, based on Nelson-Aalen estimator (Nelson, 1972; Aalen, 
1976) of the cumulative hazard function 

Jo F [v) Jo H [v) 

where (z) := P (Zi < z, 6i = 1) = G (y) dF (y) , > 0. A natural nonparametric 

estimator A„ (z) of A (z) is obtained by replacing FI (v) and (u) by their respective 
empirical counterparts Hn (v) := n~^ — '^) (v) := n~^ — '^) 

pertaining to the observed Z-sample. However, since (^n-.n) = 0, we use Hn (v—) := 
limu'i-t, Hn (u) instead of Hn (v) (see, e.g., Shorack and Wellner, 1986, page 295) to get 


A„, (z) := = 


( 1 ) 


Hn (V-) 


{Zi<z) 

> - , , , , for Z < Zn-.n- 


2=1 


n — i + 1 


From the dehnition of A (z ), we deduce that F (z) = exp {—A (z)} , which by substituting 
An {z) for A [z ), yields Nelson-Aalen estimator of cdf F, given by 


q"-"’ (z) := 1 


n 


n 


i + 1 


, for Z < Zn:n, 


where Zi-n < ... < Zn-n denote the order statistics, pertaining to the sample (Zi ,..., Zn) and 
(5[i:„], ...,d[n:n] the associated concomitants so that = di if Zj^n = Zi. Note that for our 
needs and in the spirit of what Efron (1967) did to Kaplan-Meier estimator (Kaplan and 
Meier, 1958) (given in (3.7)), we complete Fn^^'^ beyond the largest observation by 1. In 
other words, we dehne a nonparametric estimator of cdf F by 


Fn iz) := 


{z) for 2 ; < Zn-.n, 
1 for 2 : > Zn,n- 


1 
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By considering samples of various sizes, Fleming and Harrington (1984) numerically com¬ 
pared Fn with Kaplan-Meier (nonparametric maximum likelihood) estimator of F (Kaplan 
and Meier, 1958), given in (3.7) , and pointed out that they are asymptotically equivalent 
and usually quite close to each other. A nice discussion on the tight relationship between 
the two estimators may be found in Huang and Strawderman (2006). 


In the spirit of the tail product-limit process for randomly right-truncated data, recently 
introduced by Benchaira et al. (2016), we dehne Nelson-Aalen tail product-limit process by 


D, 


_ ^ / F^ {xZ^_k:n) _ 

\Fn {Zr,-k:n) J 


(1.3) 


where k = kn is an integer sequence satisfying suitable assumptions. In the case of complete 
data, the process Dn {x) reduces to (x) : = Vk {xXn.k-,n) — X j , a: > 0, where 

F„ (x) := n~^ 1 usual empirical cdf based on the fully observed sample 

(Al, ...,Xn) ■ Combining Theorems 2.4.8 and 5.1.4 in de Haan and Ferreira (2006), we infer 
that under the second-order condition of regular variation (1.2) , there exists a sequence of 
standard Weiner processes {Wn (s) ; 0 < s < 1} such that, for any xq > 0 and 0 < e < 1/2, 

_ X 


sup X 

X>Xq 


e/71 


(x) - Jn (x) - X 


X 


^/7i 


-\/kAi (n/k) 


—)■ 0 , as n —)■ cx). 


(1.4) 


where Ai (t) := Al [l/F (t)) and (x) := W„ (1). One of the main 

applications of the this weak approximation is the asymptotic normality of tail indices. 
Indeed, let us consider Hill’s estimator (Hill, 1975) 

k 

:= k-^ iXn-^+l■.n/Xn-k■.n) , 

i=l 

which may be represented, as a functional of the process D„ (x), into 

POO 

- 7 i j = y x”^D„ (x) dx. 

It follows, in view of approximation (1.4) , that 


y/k 





X ^Jn (x) dx + 


\/kAi [n/k) 

1 — T 


+ Op (1), 


leading to ^/k (n/k) X < 00 . For 

more details on this matter, see for instance, de Haan and Ferreira (2006) page 76. 


The major goal of this paper is to provide an analogous result to (1.4) in the random censor¬ 
ing setting through the tail product-limit process (x) and propose a new asymptotically 
normal estimator of the tail index. To the best of our knowledge, this approach has not 
been considered yet in the extreme value theory literature. Our methodology is based on the 
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uniform empirical process theory (Shorack and Wellner, 1986) and the related weak approxi¬ 
mations (Csorgd et al., 1986). Our main result, given in Section 2 consists in the asymptotic 
representation of Dn {x) in terms of Weiner processes. As an application, we introduce, in 
Section 3, a Hill-type estimator for the tail index 71 based on The asymptotic normality 
of the newly proposed estimator is established, in the same section, by means of the afore¬ 
mentioned Gaussian approximation of Dn {x) and its finite sample behavior is checked by 
simulation in Section 4. The proofs are postponed to Section 5 and some results that are 
instrumental to our needs are gathered in the Appendix. 


2. Main result 


Theorem 2.1. Let F and G be two cdf’s with regularly varying tails (1-1) and assume that 
the second-order condition of regular variation (1.2) holds with 71 < 72 . Let k = kn be an 
integer seguence such that — )■ 00, k/n —)■ 0 and \/~kAi (h) = 0 ( 1 ), where h = hn '■= 
Ur {n/k). Then, there exists a seguence of standard Weiner processes {Wn (s) ; 0 < s < 1 } 
defined on the probability space (r2,.4,, P), such that for every 0 < e < 1/2, we have, as 
n —)■ 00, 


sup 

x>p'y 


r 




4o, 


(2.5) 


where Jn (x) 


Jin (x) + J 2 n (x) , with 

(U := yi (1) } . 


and 

J 2 n (x) := {p^n ,2 " 9^n,l } ^U, 

where and are two independent ITemer processes defined, for 0 < s < 1, by 


Wn,i is) := {9) -Wn{0- ps)} l{9-ps>0) and W„,2 (s) := Wn (1) - IHn (1 - gs) , 

with 9 := (00) , p = 1 — q := 7/71. 


Remark 2.1. It is noteworthy that the assumption 71 < 72 is required to ensure that enough 
extreme data is available for the inference to be accurate. In other words, the proportion p of 
the observed extreme values has to be greater than 1/2. This assumption is already considered 
by Worms and Worms (201 f) and, in the random truncation context, by Gardes and Stupfier 
(2015) and Benchaira et al. (2016). 


Remark 2.2. In the complete data case, we use F„ instead of Fn and we have p = 1 = 9. 
This implies that q = 0, H = F, Urih) = n/k, Jin{x) = W„ (1), 

J 2 n (x) = 0 and W„ (s) = Wn (1) — Wn (1 — s). Since 

{hhn ( 5 ), 0 < s < 1} = {WJI) - (1 - s), 0 < s < 1} , 
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it follows that Jin (x) = Wn (x — X ^^^^Wn (1) and so approximations (2.5) and (1.4) 


V 


agree for xq = p^. The symbol = stands for equality in distribution. 


3. Tail index estimation 

In the last decade, some authors began to be attracted by the estimation of the EVI 71 when 
the data are subject to random censoring. For instance, Einmahl et al. (2008) adapted the 
classical Hill estimator (amongst others) to a censored sample to introduce ;= fp 

as an asymptotically normal estimator for 71 , where 7 := k-^ Ell log {Zn-i+l:n/Zn-k-.n) , 

is Hill’s estimator of 7 based on the complete sample Zi, ..., Zn and p := k~^ E^=i ^[n-i+i-.n]- 
By using the empirical process theory tools and only by assuming the second-order condition 
of regular variation of the tails of F and G, Brahimi et al. (2015) derived, in Theorem 2.1 
(assertion (2.9)), a useful weak approximation to in terms of a sequence of Brownian 

bridges. They deduced in Corollary 2.1 that 

A 


'/k 


7 i 


(EFG) 


7i j (^Z 


7? 


as n —)■ cx). 


(3.6) 


1 — pr p 

provided that \/kAi (h) —>■ A. For their part. Worms and Worms (2014) proposed two es¬ 
timators which incidentally can be derived, through a slight modihcation, from the one we 
will dehne later on. They proved their consistency (but not the asymptotic normality) by 
using similar assumptions as those of Einmahl et al. (2008). These estimators are dehned by 

-1 k 


7i -777W- WZ^: --rlog 


{Zn-k:n) 


1 f^{KM) / y _N 

i=l J- Grn \^n—i+l:n ) 


n—i+l:n 

Zn—k:n 


and 


^{W2) 

7i := 


n (1 - Fi^^^ {Zn-k:n) 
where, for z < Zn-.m 


E 

i=l 


'^n \'^n—i+l:n~ 


-i log 


Zn- 


7 ‘ 


p{KM) 


:= 1 - 


n 


n — % 
n — i + 1 


and (^) ;= 1 - JJ 


i:Zi.ri<z 


n — t 
n — i + 1 


l-5u 


(3.7) 

are Kaplan-Meier estimators of F and G respectively. Thereafter, we will see that the 
assumptions under which we establish the asymptotic normality of our estimator are lighter 
and more familiar in the extreme value context. We start the dehnition of our estimator by 
noting that, from Theorem 1.2.2 in de Haan and Ferreira (2006), the hrst order condition 
of regular variation (1.1) implies that x~^F{tx) jF {t) dt = 71 ,which, after an 

integration by parts, may be rewritten into 

1 


lim — , 

t^oo p (t) Z 


log (j) dF{x) = 7 i. 


(3.8) 
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By replacing F by and letting t = Zn-k-.n, we obtain 

1 r 


7i = ^ 


log {X I Zn-k-.n) dFn (x) , 


(3.9) 


Fn {Zn-k:n) JZ„-k 

as an estimator of 71 . Before we proceed with the construction of 71 , we need to dehne a 
function that is somewhat similar to ^nd its empirical version by (z) := 

P (Zi < 2 ;, (5i = 0) = F {y) dG {y) and {z) := n~^ Ya=i (1 “ {Zi < z), z > 0, 


respectively. Now, note that dFn (z) = exp <j dHn^ (v) /Hn (x—) \ dHn’ (z ), then we have 


r(i) 


Zi-/n, I X ^ 


n 




7i = 


i=n—k+l 


n—k\n 


^n-j + l 


n—k 

JJexp 

t=i 




[j-n] 


n — j + 1 


which may be rewritten into 




n ^ ‘ ^ in — 7 + 1, 

i=n—k-\-l j=l ^ ^ j=l 


n 


exp 


1 jQg 


n — j + 1 


Zn- 


n—k:n 


n—k 

JJexp 

i=i 


>[j-.n] 


n — j + 1 


which in turn simplihes to 


1 

n 


n i f 1 1 ^ 

=n—/c+1 7=1 ^ '' j=n—k-\-l 


Ir-n] [ Zj-.n 


n — j + 1 


•^n—k\n 


i=n—k+l j=l ^ '' j=n—k+l 

By changing i into n — i + 1 and j into n — j + 1, with the necessary modihcations, we end 
up with the following explicit formula for our new estimator of the EVI 71 : 

k 

ii = Y 


Zn— 


i=l 


n—2+l:n 

7 ' 

^n—k:n 


where 


^i.n • ^ ^\n— 


*^i,n 




JJexp {1/i} P[exp 


(3.10) 


J=l 


J=l 


Note that for uncensored data, we have X = Z and all the 6's are equal to 1, therefore 




k for 7 = 1,..., k. Indeed, 

1 A rn 1 A r, a n + i i i 

+,n = “ 11 exp < - ^ 1 + - [> = — 7 — 77 - ~ y, as n -)■ CX), 


n 171 n I 7 

j=k+l j=k+l ^ 


n k + 1 k 


which leads to the formula of the famous Hill estimator of the EVI (Hill, 1975). The consis¬ 
tency and asymptotic normality of 71 are established in the following theorem. 

















Theorem 3.1. Let F and G be two cdf’s with regularly varying tails (LI) such that 71 < 72 . 
Let k = kn be an integer seguence such that k ^ 00 and k/n ^ 0, then 71 —)• 71 in probability, 
as n ^ 00 . Assume further that the second-order condition of regular variation (1.2) holds, 
then for all large n 





+ 



ds — 7 i 




+ Op ( 1 ), 


where and o,re those defined in Theorem 2.1. If in addition y/kAi (h) —)■ A, then 
'/k (71 - 7i) 4 AT 2 p - 1 ^^) ’ (3-11) 


Remark 3.1. We clearly see that, when there is no censoring {i.e. p = 1 — q = 1), the 
Gaussian approximation and the limiting distribution above perfectly agree with those of 
Hill’s estimator (see, e.g., de Haan and Ferreira, 2006, page 16). 


Remark 3.2. Since 0 < p < 1 and r < 0, then 1 — t> 1— pr>0. This implies that the 
absolute value of the asymptotic bias 0/71 is smaller than or equal to that 0 / 7 !^^^^ (see 
(3.11) and (3.6)). In other words, the new estimator is of reduced bias compared to 
However, for any 1/2 < p < I, we have p/ (2p — 1) > 1/p meaning that the asymptotic 
variance 0/71 is greater than that ofj[^^'^\ This seems logical, because it is rare to reduce 
the asymptotic bias of an estimator without increasing its asymptotic variance. This is the 
price to pay, see for instance Peng (1998) and Beirlant et al. (2016). We also note that in 
the complete data case, both asymptotic biases coincide with that of Hill’s estimator and so 
do the variances. 


4. Simulation study 

We mentioned in the introduction that Worms and Worms (2014) showed, by simulation, 
that their estimators outperform the adapted Hill estimator introduced by Einmahl et al. 
(2008). Therefore, this study is intended for comparing our estimator 71 to 7 |^^^and 
with respect to biases and MSB’s. It is carried out through two sets of censored and censoring 
data, both drawn from the following Burr models: F (x) = (l + and G{x) — 

(1 + , x > 0, where pi, 72 ,7i, I 2 > 0. We hx pi = 72 = 1/4 and choose the values 

0.2 and 0.8 for 71 . For the proportion of really observed extreme values, we take p = 0.55, 
0.70 and 0.90 for strong, moderate and weak censoring respectively. For each couple ( 71 , p), 
we solve the equation p = 72/(71 + 72 ) to get the pertaining 72 -value. We generate 2000 
independent replicates of size n = 300 then n = 1000 from both samples (Xi, ...,X„) and 
(Yi, ...,Yn). Our overall results are taken as the empirical means of the results obtained 
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Figure 4.1. Bias (left panel) and mse (right panel) of the estimators 71 (solid 
line), (dashed line) and 7 ^^^^ (dotted line) of the tail index 71 = 0.2 

(top) and 7 i = 0.8 (bottom) of strongly right-censored Bnrr model, based on 
2000 samples of size 300. 

throngh all repetitions. We plot the absolute biases and the MSB’s as functions of the 
number k of upper order statistics used in the computation of the three estimators. The 
simulation results are illustrated, for the respective p-values, in Figures 4.1, 4.2 and 4.3 for 
n = 300 and in Figures 4.4, 4.5 and 4.6 for n = 1000. On the light of all the hgures, it is 
clear that our estimator performs better than the other two. Indeed, in (almost) each case, 
the minima of the absolute bias and MSE of 71 are less than those of 7 |'^^^and 7 ^'^^^ In 
addition, our estimator reaches its minima a long way before yj^^^and do, meaning 

that the number k of extremes needed for the last two estimators to be accurate is much 
larger than those needed for 71 . In other words, the cost of our estimator in terms of upper 
order statistics is very low compared to that of Worms and Worms estimators. 

5. Proofs 

In the sequel, we will use the following two empirical processes y/n ( (z) — H (^) j , 
j = 0,1] z > 0 , which may be represented, almost surely, by two uniform empirical ones. 
Indeed, let us consider the independent and identically distributed (iid) (0, l)-uniform rv’s 
Ui := (Zi) -I- (1 — Si) (6* -I- (Z)) , * = 1, defined in (see Einmahl and Koning, 

1992). The empirical cdf and the uniform empirical process based upon these rv’s are 
respectively denoted by 

Un (s) := 7)^ {i : 1 < i < n, Ui < s} /n and (s) := y/n (Un, (s) — s), 0 < s < 1. (5.12) 


Deheuvels and Einmahl (1996) state that almost surely (a.s.) 

H'i’ (z) = U„ (.-)) and (z) = U„ (a) + 9) - U„ (9), 


(5.13) 
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Figure 4.2. Bias (left panel) and mse (right panel) of the estimators 71 (solid 
line), (dashed line) and 7 ^^^^ (dotted line) of the tail index 71 = 0.2 

(top) and 7 i = 0.8 (bottom) of moderately right-censored Bnrr model, based 
on 2000 samples of size 300. 






Figure 4.3. Bias (left panel) and mse (right panel) of the estimators 71 (solid 
line), 7 !^^^ (dashed line) and 7 !^^^ (dotted line) of the tail index 71 = 0.2 
(top) and 7 i = 0.8 (bottom) of weakly right-censored Burr model, based on 
2000 samples of size 300. 

for 0 < (z) < 9 and 0 < (z) < 1 — 9. It is easy to verify that a.s. 

^ (z) - (z)) = {9) -an{9- (z)) , for 0 < (z) < 9, (5.14) 

and 

^/n {z) — (z) j = —an ( 2 ) j , for 0 < (^) < 1 — 9. (5.15) 

Our methodology strongly relies on the well-known Gaussian approximation, given by Csorgd 
et al. (1986) in Corollary 2.1. It says that on the probability space (O,^, P) , there exists a 
sequence of Brownian bridges {Bn (s); 0 < s < 1} such that, for every 1/4 < rj < 1/2, we 
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Figure 4.4. Bias (left panel) and mse (right panel) of the estimators 71 (solid 
line), (dashed line) and 7 ^^^^ (dotted line) of the tail index 71 = 0.2 

(top) and 7 i = 0.8 (bottom) of strongly right-censored Bnrr model, based on 
2000 samples of size 1000 . 






Figure 4.5. Bias (left panel) and mse (right panel) of the estimators 71 (solid 
line), 7 !^^^ (dashed line) and 7 !^^^^ (dotted line) of the tail index 71 = 0.2 
(top) and 7 i = 0.8 (bottom) of moderately right-censored Burr model, based 
on 2000 samples of size 1000 . 


have 


I ("S) Byi (s) I 
sup —rn -^— 


Op . 


(5.16) 


For the increments ( 6 *) — (0 — s), we will need an approximation of the same type 

as (5.16) . Following similar arguments, mutatis mutandis, as those used in the proofs of 
assertions (2.2) of Theorem 2.1 and (2.8) of Theorem 2.2 in Csorgd et al. (1986), we may 
show that, for every 0 < 6 * < 1 , we have 


sup 

l/n<s<6 


|{an { 0 ) 




Op . 


(5.17) 
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Figure 4.6. Bias (left panel) and mse (right panel) of the estimators 71 (solid 
line), (dashed line) and 7^^^^ (dotted line) of the tail index 71 = 0.2 

(top) and 7i = 0.8 (bottom) of weakly right-censored Bnrr model, based on 
2000 samples of size 1000. 


5.1. Proof of Theorem 2.1. The proof is carried ont throngh two steps. First, we asymp¬ 
totically represent Dn {x) is terms of the empirical cdf Hn and the empirical snb-distribntions 
fnnctions Hn'^ and Hn^. Second, we rewrite it as a fnnctional of the following two processes 

(w) := jon (6) -an(o - {wZn-k-.n)^ | , for 0 < {wZn-k-.n) < 6, (5.18) 


and 


T7(0) 


T7(0) 


(in) := -y (l-W' (wZ^-k:n)) , for (wZ^.^-.n) <1-0, (5.19) 

in order to apply the weak approximations (5.16) and (5.17) . We begin, as in the proof of 
Theorem 2.1 in Benchaira et al. (2016), by decomposing (x) into the snm of 

Fn {xZn—k:n) F (^xZn—k:n) 


^nl 


(x) : = 


F {Zn-k-n) , 

'-2 1^1 := 1 -T - 1 

^ n \^n—k:n) 


F {Zn-k:n) 

Fn {F^n—k:n) P {_^^n—k:7i) 

F (Zn-k-.n) 


^ / \ F (^xZfi—k:n) Fn (^Zn—kin) F (^Zfi—k:n) 

^n3 \^) • 


F {Zn-k-.n) 


Fn (^Zji—k:n) 


and 


.„4 (X) := 


F (Zn-k-.n) F (h) 

In the following two snbsections, we show that, nniformly on a: > p'^, for any 1/4 < p < p/2 
and small 0 < Cq < 1, we have 

3 

VkMn. (X) - Jn (X) = Op , 


2=1 


(5.20) 
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where Jn {x) is the Gaussian process dehned in Theorem 2.1. It is worth mentioning that 
both M„2 {x) and M„3 (x) may be rewritten in terms of M„i (x). Indeed, it is readily checked 
that 


M„2 (ai) 


/ F (Zn-k-.n) 
\Fn ^^n—k:n) 


M„i (x) and M„3 (x) 


F (^XZn—k:n) 
Fn i^^n—k'.n) 


M„i (1) . 


Then, we only focus on the weak approximation y/kWlni (x) which will lead to that of 
V^M„3 (x) and the asymptotic negligibility (in probability) of '/k'Mn 2 (x) . Finally, the ap¬ 
proximation of \/~kMnA {x) will result in the asymptotic bias. In conclusion, we might say 
that representing (x) amounts to representing M„i (x). 


5.1.1. Representation o/M„i (x) in terms of Hm Hn'^ and Hn'^. We show that, for 
all large n and x>p^ 


M„i (x) = T„i (x) -f T„2 (a:) + T„3 (2:) + Rn (x), 


(5.21) 


where 


T„i (x) : = 


F (Zn-k-.n) 


Tn2 (a:) : = 


Tn3 (a:) : = 



d (W) - hfd) (y;)j 


H(w 


d (v) - ] 

— - (w ), 

H (n) 


JxZ„-k:n F (Zn-k: 

r _F(;.) ( r 1 (,„), 

I xZ„_k:n F (Zn-k-.n) [Jo H (v) ) 


and Rn(x) := Op for every 1/4 < rj < p/2 and small 0 < Cq < 1. 

For notational simplicity, we write 6^’^° := max (6'^°, 6“'^°) and without loss of generality, 
we attribute Cq to any constant times Cq and to any linear combinations of and 

^±£260^ for every Ci, C2 > 0. We begin by letting ^ (w) := F (w) /H (w) which may be rewritten 
into exp | dH^^^ (v) /H (n)} whose empirical counterpart is equal to 


(w) := exp 


(!.) /ff„ (!.) \ . 


Observe that F (xZn-k.n) = ^ ^ (w) dFl^^ (w) , then by replacing I and Fl'd) by ^n and 

Hn'^ respectively, we obtain Fn (xZn-k.n) = in (w) dH^p (w). Thus 

o XZjy^ — h\n. 

^ i(w 


Mni (a;) = [ ,^(i) ' 

JxZ„-k-.n F (Zn-k-.n) JxZ„-k-.„ F (Zn-k.n) 

By applying Taylor’s expansion, we may rewrite in (w) — i (w) into 
rdHiJ'>(v) ^dH^F^y)^ 1-. r r^^zT(o) 


dH^^^ (w ). 


(5.22) 


i(w) 


'0 


Hn (V) 


'0 


H(v) 


+ \in(W) 


^ dH//> (n) r dH^^^ (n) 
~ Jo H(v) 


'0 -n n 


Hn (V) 


2 
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where {w) is a stochastic intermediate value lying between i (w) and in (w). This allows 
us to decompose M„i (x), in (5.22) , into the sum of 

= i{w) 


T;, (a:) := 


^ (^n—k:n) 


d (Hi'> (w) - ff'‘> («.)) , 


T;2 (a:) := 


T;3 (at) a= 


e(w) ( f- d {h!°Hv) - HV>Hv)) 


H(.v) 


Rnl (^) • 


JxZn-k:n F {Zn-k:n) I Jo 

r Ffsi/' 

'xZn-k-.n ^ [^n-k:n) \ JO 

_i{w) 

^^n—k:7i F{Zn-k:n) [A Hn{v)H\v) 


dH!;i (w) , 


"(-2 ""(''( rfgf (<>) j dHW («>), 

H (v) J 


and 


Rn2 (x) := - 


in {w) 




} iH'n'' (») 


'^JkZ„-t„F{Zn-k,n) yjo Hn(v) Jo H (v) J 

First, we clearly see that T* (x) = T„i (x). Next, we show that T *2 (a)) and ¥*3 (x) are 
approximations of Tn 2 (x) and ¥^3 (x) respectively, while Rni (x) = Op 2 :( 2 » 7 -p)/ 7 ±eo^ 

z = 1, 2, as n —)■ cxD, for any x > p'*'. Since, from representation (5.13) , we have H\ {w) = 
U„ (w)j a.s., then without loss of generality, we may write 

d {v) - FF(o) (w)) 


Tn2 (a:) = 


/■“ i (w) 

^ {Zn—k:n) I 20 


g(!>) 


(RJn (w)'^ . 


Let (u) := inf |tc : (w) > m| , 0 < m < 6 *, and set t = Vn (tc) j or, in other 

words, w = (Yn (t)) , where denotes the empirical quantile function pertaining to 
U„. By using this change of variables, we get 




¥* — 
^n2 — 


{xz„_,.,n) £ ^Yn (t)) ( d (n) - FF^°^ (n)) 


F (yZyi—k:n) I Jo 


H(v) 


dt, 


where, for notational simplicity, we set C (s) := i (s)) . Now, we decompose ¥*2 (x) into 
the sum of 




J^ni (a)) . 


Oz„_fe.„) £ ^Yn (t)) d (n) - FF^°^ (n)) 


I H^^\xZ„_k:n) F{Zn-k,n)Jo 


H{v) 


dt. 




An 2 (x) : = 


{xZn-k-.n) £ _ £ (^Yj /•Q''^HVn(b) d ^FF^^^ (n) — FF*'^^ (n) j 


F {^Zn—k'.n) Jo 


H(v) 


dt, 




An’S, (ao) . 


£ (i) rQ^^H^Rt)) d (FFi°^ (n) - FF^°^ (n)) 


F ^Z^i—k-.n) JQ^^^{t) 


H(v) 


dt, 
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and 




(1) 


Ani (x) : = 


^ |^Q(^Ht) d [nf (n) - (n)) 


dt. 


Jo F {Zn-k:n) Jo H (w) 

It is clear that the change of variables w = (t) yields that An 4 (x) = T„2 (x). Hence, 

one has to show that Ani (x) = Op i = 1, 2, 3, as n —)■ oo, nniformly on 

X > p'^. We begin by Ani (x) for which an integration by parts gives 


f* ^ 


-^nl • 


(t)) d (v) - (n)) 


' H 


^^\xZn-k:n.) F [Zn-k:n) Jo 


H(v) 


dt, 


.77(1)/ 


^ (Y„ (i)) r Hf (Qd) (V„ (f))) - H^'^’ (Qd) (V„ {t))) 


T7(0) 


^rj.T 


fH 




H(Q<» (V„(«))) 


Jo H(v) ‘ 

If In {x) denotes the interval of endpoints {xZn-k-.n) and {xZn-k-.n), then from 
Lemma (6.1), we have snp^gj^^^,) (£ (f)/£ (V„ (f))) = Op (1), nniformly on a: > p'^. On 
the other hand, form representation (5.13) , we infer that 

|:h® (t), 0 < s < 1 - 0 } = |u„ (t)), 0 < s < 1 - 0}. 

Then by using assertion (ii) in Proposition 6.1, we have that 

-M+(x) 


Ani {x) = Op {n '^) 


' M, 


n (a:) F (^Zn—k:n) 


(gd) (V„ (t)))y " .o(i)(v„(i)) (n)) " dH (v) ^ 

"" ^ F(gd) (Vn(t))) ^ Jo WJ^) ^ 


where M+ (x) and M~ (x) denote, the maximum and the minimum of {xZn-k-.n) and 
[xZn-k-.n) respectively. Observe that both o gd) and H o gd) are regularly varying 
at inhnity with the same index equal to 1. Then by using assertion {Hi) in Proposition 6.1, 
we readily show that Ani {x) equals 


Op (n-^) ^ I (^)))' " W (^))' " dH {v) 


F {,^71—k-.n) JMn (x) 


H{Qm (i)) 


H(v) 


dt. 


Note that < H, then it follows, after integration, that 

^ , (U = l^(«) it 
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which, by a change of variables, becomes 


Anl (x) = 


Op {n-^) {Z^_k:n) c f) 


F {Zn-k-.n) 




hf (g(l) {Zn-k:n)t 


■^dt. 


From now on, a key resnlt related to the regnlar variation concept, namely Potter’s ineqnal- 
ities (see, e.g.. Proposition B.1.9, assertion 5 in de Haan and Ferreira, 2006), will be applied 
qnite freqnently. For this reason, we need to recall this very nsefnl tool here. Snppose that 'h 
is a regnlarly varying fnnction at infinity with index k, and let Ci, C2 be positive real nnmbers. 
Then there exists to = to (ci, C2) snch that for t > to, tx > to, 


(1 — Cl) x^'min < T (tx) /T (t) < (1 + ci) x'^’max (a;‘”^,a;®). (5.23) 


Since C and H o QP) are regnlarly varying at inhnity with respective indices p — 1 and 1, 
then we nse (5.23) to write that, for snfficiently small Cq > 0 and for all large n, we have 


(x^ 


Op (n-^) {Zn-k:n) F, (Zn-k-.n)') .M+(x)I^p-l±eo 


F(Zn-k-.n) H [Qi^) [H^^\Zn-k-.n) 




dt. 


From assertion (i) of Lemma 4.1 in Brahimi et al. (2015), we have pH (t) (t) —)■ 1, as 

t —>■ 00, hence (Zn-k-.n) /H (Zn-k-.n) ^ P- On the other hand, we have (t)) = t 

and (Zn-k.n) ^ 1, thus 


(Zn-k-.n) C (Zn-k-.n)) 


F(Zn-k-.n) H (QH {H^^\Zn-k-.n) 


= (l + oAl))(n/kr 


Then, after integration, we obtain 


Ani (x) = 


Op (n-^) 

(k/nf 


' —(1) , \ P-f?±co /—^ p-?j±eo 


H\’ (xZn-k-.n) 


TtH 


H^ ’ (Zn-k-.n) 


W ’ (xZn-k-.n) 
H^'^ (Zn-k-.n) 


We have — (Zn-k-n) ^ Pi therefore 
k 


A Ax)- 

^ ^ “ (k/n)^^^° 


(xZn-k-.n)y~"^^° - (xZn-k-.n)) 


, P-»?±eo 


By applying the mean value theorem, then assertion (i) in Proposition 6.1, we have 

4 uP)^.„v ^ uP)^^v A ^Y-v-i±eo 

■^nl (^) , , \^^n—k:n) H (xZ>yi—k\n) [H (xZ^i—k-.n) ) 

[k/n) ^ / 

and by assertion (ii) in Proposition 6.1, we get 

Op(n“2’’) /—(1) , _ ,xp-2,?±6o 
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Once again, by using the fact that (Zn-k n) ^ P together with routine manipulations 

k 

of Potter’s inequalities (5.23) , we end up with Ani {x) = Op ^{‘2v-p)/'y±<^o_ pet us 

now consider the term An 2 {x), which may be decomposed into the sum of 

d (n) - (n)) 

r^—^/'ypIn 


(a^) := 


F (2^7T,_/c:n) Jo 


H(v) 


-dt, 


and 


(a^) : = 


£ (v„ (t)) - C (t) d (n) - (n)) 


H(v) 


dt. 


^ (.^n—k'.n) Jo 

For the term A^^^ (^) > 'w® integrate the second integral by parts to get 

£ (V„ (t))-C (t) ( 77™ (Q'H (V„ (i))) - 77'”' (Q»> (V„ (i))) 


A 2 (x) = 




F i^n-km) 


77(Q«(V„(«))) 




7o (H{v)f 

Using assertion (ii) in Proposition 6.1 yields 

TC~^!Apf' 


dH (v) > dt. 


(6.24) 


dt 


F{Zn-k:n) {H {QW (t))) 


V ’ 


nH (^xZn — i.--n) (t') 

£ (v„ it)) 

dt 


C{t) 

(77(QW (()))’ 


APx) =Op(n-’>) 
which may be rewritten into 

Op ("-’) 

In the interval of endpoints [xZn-h.n) and x~^^'^p/n, we have, for large n, V„ (f) is 
uniformly close to zero (in probability). On the other hand, assertion (i) in Proposition 6.1, 
implies that (V„ (t) /f)^ is uniformly stochastically bounded. Then, making use of Potter’s 
inequalities (5.23) applied to the regularly varying function £ (■) (with index —q), we have 

C {Yn (t)) 


(1 - eo) (V, (t) /t) 


-tO-q 


< 


Lit) 


< (1 + ^o) (yn (t) /t) 


£o-g 


It is clear that 

£ (V„ (t)) 


£(t) 


< max {|(1 + eo) (V„ {t) /tY° - l| , |(1 - eo) (¥„ (t) /t) 


-eo-q 


1 |}- 


For the sake of simplicity, we rewrite this inequality into 

|£ (V„ it)) /£ (t) - 1| < 1(1 ± eo) (V. (t) - 1| 

which is in turn is less than of equal to [(¥„ (t) — l| + eo (¥„ (t) . In other 

words, we have 


|£(v„(«))/£W-i| < 


|(V„(()) 


±eo-q 


t±^0-q\ 


t±6o-<7 


+ ^0 (Yn (t) /t) 


±eo-q 
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By making use of assertion [ii) in Proposition 6.1 (once again), we show that 
\C {Yn (t)) /C{t)-1\< Op (n-^) + Op (1) _ 


Therefore 


A 


( 1 ) 

n2 


X 


< Op (n-2^) 


rH^'^^xZ^-k-.n) 


o 


p ("-") 


rH^^\xZ„_k:„) 


C (t) ^{v-lK±eo-q) 
F{Zn-k:n) (H (t)))" 
C {t) dt 

O' 


dt 


- x-^hpln F {Zn-k-.n) {H (QP) (f))) 

By a similar treatment as the above (we omit the details), we end up with ^4^2^ (^) = 
Op For the term ^4^2^ (x) , we start by noting that x <l/n, for 

any x > . Since Yn (t) = Ui^n, for 0 < f < 1/n, it follows that 


A 


( 2 ) 

n2 


X] = 


£(Oi,„)-£(t) 


dt 




Hiv) 


F (Zn-k-.n) 

For the second integral, we use analogous arguments based on assertion (ii) in Proposition 
6.1, to write 

Op (n-^) rx-^i-^v/n 


(X) = ^ 


(C(U,..n)+C(t))dt. 


F(Zn-k-.n) {H(Q^F (U,.,n))y Jo 

Note that the latter integral is equal to C (Ui,n) px~^l^ jnY fj F (t) dt. By Potter’s in¬ 
equalities (5.23) on C and the fact that riUi-.n A 1, it becomes (1 -|- Op (1)) pn~^C (l/’^-) . 

By replacing C by its expression, we get 


Y2 M = T= 


F(Q«(l/n)) 


n 


-71-1 


F(//-(1-Vn)) (7/(Q(i) (!/„)))’+' 
Now, we apply Potter’s inequalities (5.23) to F, to write 
F (Qli) (l/n)) _ ^ k/nF 


Op(i-‘0), 


F(H^(l-k/n)) 


Since H (w) > (w ), then (s) > H'^ (1 — s) and therefore 

H^(l-k/n) if^(l-fc/n) 

QP)(l/n) “ if'^(l —1/u)’ 

which, by (once again) using Potter’s inequalities (5.23) to (1 — s), equals O (k~'^'^'^°). 
Then the right-hand side of (5.25) is asymptotically equal to O . On the other 

hand, we have (iP (QP^ (l/n))y'^^ > (l/u))^ = it follows that ^4^2^ (x) 

Op (fc“^^^°) where p = 7/71 is assumed to be greater than 1/2. Consequently, we 

have ^41^2^3^) = Op (k~‘^^^^°) ^ for any 1/4 < p < p/2, and thus 74^2(3:) = 

Op (A;“^’^^'^°) as well. By similar arguments, we show that An 3 (x) asymptoti¬ 

cally equals the same quantity, therefore we omit the details. Finally, we may write that 


QP) (l/n) 


as u —)■ cx). 


(5.25) 














19 


^n 2 (^) = '^n 2 (x) + Op _ Pq^ ^he term ¥*3 (x) we proceed as we did for 

T *2 {x) but with more tedious manipulations. By two steps, we have to get rid of integrands 
dHn^ (v) and dHn^ (w) and replace them by their theoretical connterparts dH^^'> {v) and 
dH^^^ (w). First, we dehne, for 0 < t < 1 — 0, 

(t) ;= inf |u : (v) > and Q® (t) := inf |u : (v) > t 


By the change of variables v = Qn'^ (t) and similar arguments to those nsed for the terms 
Ani (x), we show that 

= i{w) I rH{v)-H„iv) 


t;3 (^) = 


'xZ„-k-.n F ^Zn-k-.n) 

Op ^(25?-p)/7±eo 


iv) 


(u) } dH^^^ (w) 


Second, we nse the change of variables w = Qn^ (s) and proceed as above to get ¥*3 (x) = 
Tns (x) + Op |;Pig stage, we proved that ¥„i (x) is exactly ¥*3 (x) and 

that ¥^2 {x) and ¥^3 (x) are approximated by ¥*2 (a:) and ¥*3 (x) respectively. For the hrst 
remainder term Rni (x), it suffices to follow the same procedure to show that, uniformly on 
X > p'^, y/kRni (a;) = Op fQp the last term i ?„2 (a;) , we nse similar 

technics to the decomposition 






Hn {V) 


H(v) 


,d (u) - (u)) 


H„ (v) 


r /p(„)-_g(„) 

H„(v)H(v) '• 


to show that \/kRni (x) = Op {k 2 p±eo^ 2 .( 2 p p)/ 7 ±eo g^g -well, therefore we omit details. Now, 
the weak approximation (5.21) of M„i (x) is well established. 

5.1.2. Gaussian approximation to Dn (x). We start by representing each ¥„j (x), j = 
1, 2, 3, in terms of the processes (5.18) and (5.19) . Note that ¥„i (x) may be rewritten into 

_ ... , -TfW 

T„,w=-r 1 

H(w) 

which, by integration by parts, becomes 


d (ffl'* (m) - i/'" («,)) 


Tnl(x) = 


F(m) i/i" (u>) - ff'*’(») 


F {Zn-k-.n) 


H(w) 


+ 


Hn M 


{w) dF {w) 


— k.:n 


xZt). 


Hiw) 


F {Zn-k-n) 


Observe that, for u > Zn-.n, H^n i'^) — 0 from Lemma 4.1 in Brahimi et al. (2015), 


(w) /H (w) tends to p as tc —)■ 00 . It follows that 


lim F (w) —" 


H(w) 


-= — hm F (w) -= 0. 


H{w) 
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Thus, after a change of variables, we have 

F {xZn-k-.n) H^n {xZn-k.n) “ (xZ^-k-.n) 

li„i hr = ^-=- 

F(Z^_,.,^) H(xZr,.k:n) 

^ r H^n {wZ^_k-,n) - {wZn-k:n) dF {wZn-k:n) 

Jx H {wZ^_i^-j^'j F i^ZYi—k-.n) 

Making the change of variables w = F^ (l — sF (^n-fc:n)) iZn-k-.n ='■ in (s) and using (5.14) 
and (5.18) yield 


T„,i (x) = -= 


^/fc F {xZn-k:n) f^n (3^) ^ 

n F {Z^-k:n) H {xZn-k-.n) ^ 




f3n {in (s)) 


^ (in (^) Zn—k:r 


-ds. 


Next, we show that, for eo > 0 sufficiently small, we have 

VkTni (x) = (x) + - j (w) dw + Op (a;‘^^^“P^/'>'=‘=^°) . (5.26) 

1 J X 


To this end, let us decompose T„i (x) into the sum of 


TiV M :=—^ 


\/k F Pn (^) 




. tS (a:) := 


n F{Zn-k-.n) H{xZn-k:n) 


/— \ n — k-.nj /-I / 

yk I ^{^n — k-.n) Pn (^)) 


t2 > (U := - 

n 




^ Jx H (in (s) Z^—k-.n') 

t^n (in (s)) 


ds, 


H (in ("S) Z^i—k-.n) 


1 ^ 


H{s-'^^h) 


ds, 


ThW:=- 

n 


C-l/71 


l^n{in{s)) - fdnjs 
H{s-^^h) 


ds. 


tS (U - — 


-1/71 


n 


(h) 

H{s-'r^h) 


FjCFds, tS (X) := Fi 

s'^FiH (h) ’ ^ ^ n ^ 


i;-l/7i 


^FP-ds. 


'o s'^FiH (h) 


We shall show that VkT^jl (x) = Op p)/7±eo) ^ ^ _ 2^ •••,5, uniformly on x > while 
(x) and (x) are approximations to the hrst and second terms in (5.26) respec¬ 

tively. Let ns begin by y/kT^^l (x) and write 

k r'*' l/3n «»(*))! 


Vk tS {x) 


< - 


-ds. 


nJo H (in (s) Zn-k:n) 
where c„ (x) := max (F (xZn-km) /F (Zn-k-n) ,x~^/p . Next, we provide a lower bound to 
H (in (s) Zn-k:n) by applying Potter’s inequalities (5.23) to F. Since Zn-k-.n —>■ C)0 a.s., then 
from the right-hand side of (5.23) we have F (xZn-k-.n) /F (Zn-k-.n) < 2 x~^FFko which 
implies that c„ (x) < 2 x~^Fk^ko ^ rewrite the sequence 

in (s) into 

(^ - .FF(Z.. 

(5.27) 


F*- (l - sF (Zn-k-n)) 

in (s) = 0 < S < 1, 


(1 - F (Zn-k.n)) 

and use the left-hand side of Potter’s inequalities (5.23) for the quantile function u —>■ 
F*- (l-u), to get in (s) > 2“^s"^i=^"o a.s., for any x~^Fi < g < F (xZn-k.n) /F (Zn-k.n) ■ 




















This implies that (s) > > 0. Then (s) Zn-k-.n —t oo a.s. and there¬ 

fore, from the left-hand side of (5.23) (applied to TT), we have H {Zn-k-.n) IH (^„ (s) Zn-k-.n) = 
^ (^{^n (5))^^"’’^'^°^ a.s. uniformly on s. This allows us to write that 

(a^) = O (1) — ^ / (^n \l3n {in {s))\ds, a.S. 

n [Zn-k-.n) Jo 

By combining Corollary 2.2.2 with Proposition B.1.10 in de Haan and Ferreira (2006), we 
have {n/k) H {Zn-k-.n) ^ 1, hence 

^2a;-l/T'l±eo 

V^T.® (X) = Op (1) / (J„ (S))*'®'" \X ((n (S))! *. 

Jo 

From (5.23) , we infer that 0 < s < 2xq =; sq. On the other hand, in view of assertion 

{ii) of Lemma 6.2, we have supo<s<so (s))*'^”^^^"’’ \IJn {in (s))| = Op (1), therefore 

^2a;-l/T'l±Eo 

^T®(i)=Op(l) / 

Jo 

Note that (1 — r/) /y ± cq > 0, then by using the right-hand side of (5.23) (applied to 
(1 - •)), we get 

^2x-i/n±so 

y/kT^^l (x) = Op (1) / 

Jo 

which equals Op . Recall that 71 = 7/p, then it is easy to verify that {x) = 

Op (0:^^'’“^)/'’'^'^°) . By using similar arguments, we also show that (x) = Op , 

i = 3, 5, therefore we omit the details. For the term T„4 {x ), we have 


Vk 


tS’ (2^) 


k 

< - 
n 


C-l/71 


|/^n(Cn(g)) - (Jn{s | 
H{s-^^h) 


ds. 


In view of (5.23) with the fact that H {h) = k/n, we have 


(X) = Op (1) / \Pn {in {S)) - ^n | ds. 

Jo 

From assertion {ii) of Lemma 6.2, we have jdn {in (s) — 5“'*'^) = Op , uniformly 

on 0 < s < then after elementary calculation, we end up with y/kT')^l (x) = 

Op _ As for the first term Tni (^)) suffices to use Potter’s inequalities (5.23) 

(for F and H) to get 


(x) = x-^/'^^(3n {x) + Op (a;(2'?-p)/7±^o) _ 

Finally, for (x) we observe that (x) = (s”"^^) ds, which by a change 

of variables meets the second term in (5.26) . Let us now consider the term T„2 {x ). First, 
notice that 

d (vf (v) - 7“' (i.)) 


» d (rf* (ti) - i/‘“> (!>)) 
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which, after an integration by parts, becomes 


(//f (0) - (0)) 


Hn (w)_- H (w) 

H{w) 




dH (v) 


H (v) 


It follows that T„2 {x) may be written into the sum of 

TSW:=-(S'r (0) - 3?'“’(0)) 


tS (a:) := 


and 


tS (x) := 


— k: 7 i 


F{w) 


r°° F (w) (w) 

F {Zn-k:n) H (w) 

(w) - (w) F (w) dH^^^ (w) 

FH F{Zn-k:n) H (w) ’ 


'xZn-k-.n^ {Zn-k-.n) [ Jo 
J-YJW 


fjjW / N -TfW / 

[v) -H (v)J—. 


dH (n) I dH^^^ (w) 


H (v) J H{w) 


For the hrst term, we replace dH (w) and H (w) by G (w) dF (w) and G (w) F (w) respec¬ 
tively and we get 

tS (^) = (731“’ (0) - 37'“’ (0)) 

By using the routine manipulations of Potter’s inequalities (5.23) (applied to F), we obtain 
’^n2 (^) = (^-^n ^ (0) ~ (0)) Op _ On the other hand, by the central limit 

theorem, we have H^^^ (0) — iif (0) = Op , as n —)■ cxd, it follows that VkT ^2 (^) = 

^Jk/nOp _ Since = q (^^{‘iri-p)h±eQ'^ then y/kT^^l [x) = 

Op _ It is easy to verify that (^) may be rewritten into 

y/kJ^F ^ r _Pn{w) dF{wZr,-k-,n) _ 

A H{wZn-k-,n) F{Zn-k-,n) 

By using similar arguments as those used for T„i (x), we show that 

T) ~ 

'/kT^^2 (^) =-/ (lyj _|_ Qp ^^{2)?-p)/7±e0^ ^ 

F J X 

therefore we omit the details. As for the third term ¥[^2^ (x), we have 

pwZn — k-.n 


tS (a:) = 


33.'“’ (tO - 33'“’ („: 


dH {v) [ ^F {wZn-k-.n) 


(n) J ” F {Zn-k-.n) 


After a change of variables and an integration by parts in the hrst integral, we apply Potter’s 
inequalities (5.23) to F, to show that 

ri / — I r^^n — k-.n 

•\/fcT^2 (^) =-/ 

'y J X 

where 

poo I 

VkRns (x) := Op ^ j ^/}^ 


y/k (h^^ (n) - (n)') dy;+v^_R^3 (x), 

^ ^ H (n) J 


H„ iv) - H n 


dH {v) 
H^ iv) 


dw. 
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By using once again assertion (n) in Proposition 6.1 with the fact that < H together 
with Potter’s inequalities (5.23) routine manipulations, we readily show that y/kRns (x) = 
Op , uniformly on a: > p'^, therefore we omit the details. Recall that 71 = 7/p 

and let us rewrite the hrst term in ¥^^2 (^) 







dH (n) 

h\v) 




which, by an integration by parts, becomes 




w y/k 


(id® {wZ^_k:n) - {wZr,.k-,n)) 


dH (wZji_i; ji^ 
H {w ZYi—k'.n^ 


k 


/-ft2 


The hrst term above may be rewritten into-/ jdniw) dH {wZn-k-.n) /H (wZn-k-.n) and 


n 


by similar arguments as those used for T„i [x) , we show that the second term equals 

- {w) dw + Op (2:(2 »?-p)/7±co^ 


{x) = -- [ (5n {w) +- [ (w) dw + Op P)/7±coj ^ 

^Jo H {wZn-k:n) 


Consequently, we have 


y/kTn2 [x] = —X — 


H {wZn-k:n) ^ 

For the term ¥^3 [x) , routine manipulations lead to 


( 3 n {w) ^ f [yj] dw+Op p)/7±eo^ _ 


\/fc ¥„3 (x) = X 


— ^,-1/71 


k 


n 


f(0) 


« F...^ , « F...^ \ {wZ^_k-,n) 

f^n (^) “ 1 “ Hn (^) ) _ 2 

H {wZ,^_}^.^ 


1 

7 


w 


-1/72—1 


[jdn {w) + jdn (w) j dw + yjkR^A (x), 


where 


y/kRnA (x) := Op (1) x 


n 


\f 3 n (tn)| + f 3 n {w] 


dH^^'^ {wZn-k:n 
H {wZn—k-.n) 


+ - f W ^ (|/?„ (w)| + Pn {w) j 


dw. 


By a similar treatment as that of y/kRn 3 (x ), we get y/kRnA (x) = Op p)/7±eo^ _ gy 
substituting the results obtained above, for the terms Tnj (x), j = 1, 2,3, in equation (5.21) , 
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we end up with 


y/kM^i{x) = + X I / (ly) —, 


{wZn k:n) 


n 


^ ~ {wZn-k:n) 

Pn {W) ^2 - 

H {w ZYi—k\n) 


H (w 

+ Op . 


The asymptotic negligibility (in probability) of \/k'Mn 2 (x) is readily obtained. Indeed, 
note that we have \/fcM„i [x) = Op p)/7±eo^ from Theorem 2 in Csorgd (1996), 
we infer that F {Zn-k-n) / Fn{Zn-k:n) — 1 = Op _ This means that \/~kM.n 2 {x) = 

Op (because fc —)■ oo). Recall now that 

VkMnS (x) = (1), 

-O \y^ri—k\n) 

which, by applying Potter’s inequalities (5.23) to F, yields that y/kWlns (x) is equal to 

k 


(x) — X 


-1/71 _ 


n 


^ . . dH^^> {wZ^_k..n) 

Pn [W) ^2- 

H {w Z^i—k-.n) 


dH^^^ {wZr,.k:n) 

Pn (tt') ^2- 

H {w Z-yi—k-.n^ 


+ Op (x(27-p)/7±^0) _ 


Therefore 
3 


VkM^, (x) = xF^^/3n (x) - x-F'r^/dn (1) + (/3„, /3„; x) + Op (a;(2’?-p)/7±^o) ^ 

i=l 


where 




i,f3n;x] := X 


— ^-1/71; 


n 


^ ^ . .dH^''\wZn-k-,n) f 

Pn [W) ^2-/ 

H {w Z^i—k-.n) 


. dH^^^ {wZn-k:n) 
Pn {W) ^2- 

H {wZ^_]^-j^ 


which, by routine manipulations as the above, is shown to be equal to 


X 


-1/71 J .,-1 


Pi ^ 1 / y;l/7 (^y^^ dtC > + Op (x^^^ P)/7±e0^ _ 


Therefore, we have 


^ \fkMLni (x) = X^Oa^^ _|_ Qp p)/7±eo^ 


2 = 1 


- X J (^1) - 7^ W (y;) c^y; _ 1 / y;l/7 (y;) g^y; 


We are now in position to apply the well-known Gaussian approximation (5.16) to get 

3 


VkMni (x) = xOt' 2 (x) - X (1) 

2=1 


+ x-^Oi^ r^^^dH\w) 
h H (w) 


/•'B^ 
/o F^w) 


dH {w) I + Op (a:(2^-P)0±^o) , 
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where B„ (w) and B* (w) are two Gaussian processes defined by 
Bn (w) := Bn (6) -Bn{e- if {wZn-k:n)) and B„ (w) := -Bn (l - ff {wZn-k:n)) • 


By similar arguments as those used in Lemma 5.2 in Brahimi et al. (2015), we end up with 

3 

'^y/kMni {x) 

i=l 


= 






where (s) and (s), 0 < s < 1, are sequences of centred Gaussian processes defined 
by B„ (s) := Bn {0) — Bn{9 — ps) and B„ (s) := —Bn (1 — qs). Let {Wn (t); 0 < t < 1} be a 
sequence of Weiner processes defined on (G, A, P) so that 


{Bn {t)]0<t<l} = {Wn {t) -tWn{l)]0<t<l}. (5.28) 


It is easy to verify that J2i=i (x) = Jn (x) + Op , which is exactly (5.20) . 

Finally, we take care of the term M„4 (x). To this end, we apply the uniform inequality of 
second-order regularly varying functions to F (see, e.g., the bottom of page 161 in de Haan 
and Ferreira (2006)), to write 


and 


= (1 + Op (D) 4, 

F [Zn-k-.n) 'Phi 

^ ^-1/71 = (1 + o (1)) yl, (/,) x-F'r^ ^ 


F{h) V ■ “ \ \ 

uniformly on x > p'^. Since Ai is regularly varying at infinity (with index ryi) and Zn-k-.n/h h 
1, it follows that Ai (Zn-k-.n) = (1 + Op (1)) Al {h ),, therefore 

X^/7i _ 1 


V^M„4 (x) = (1 -h Op (1)) X : 


-p/ll 


-hkAi (h), as n —)■ oo. 


By assumption we have hkAi (h) = O (1), then 

F / 'Tl T 

Dn (a:) - Jn (x) - X-F'r h " = U{ 2 ^-p)h±eo\ _ 

r/7i 

Let po > 0 such that 1/4 < p < pq < p/2, then 

C FI 'Tl T 

^{p-2m)h 1(a;) _ (a;) _ x-F'^^h_^hkAi (h)| = Op [x^(p-Po)/'y±^o^ . 

Now, we choose eo sufficiently small so that {p — Po) /l + < 0. Since x > > 0, then 

Op (a;^G-w)/7±co^ = Qp ^ hence for 1/4 < po < p/4: 

•'r/71 _ 1 ^ 

—- -VkAi 

x/li 



^(p-2vo)h JJ^ ^x) - Jn (X) 
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uniformly on a; > To achieve the proof it suffices to replace 7 by p 7 i and choose e = p—2riQ 
so that 0 < e < 1 / 2 , as sought. □ 

5.2. Proof of Theorem 3.1. For the consistency of 71 , we make an integration by parts 
and a change of variables in equation (3.9) to get 

1 Fn ^n—k-.n) 


7i = 


X 


-dx, 


F'n i^^n—k-.n) 

which may be decomposed into the sum of Ji„ := x~^F (xZn-k-.n) /F (Zn-k-.n) dx and 
^2n ■ = r X ^ J2i=i^ni (^) dx. By using the regular variation of F (1.1) and the corre- 

p 

spending Potter’s inequalities (5.23) , we get /i„ — >■ 71 as n —)■ cxd. Then, we just need to 
show that l 2 n tends to zero in probability. From (5.20) we have 


-^2n — —^ 


\/k 


°° o ni 

X~^Jn{x)dx+ ^ 


y/k 




where the second integral above is hnite and therefore the second term of l 2 n is negligible 
in probability. On the other hand, we have x~^Jn (x) dx = r X ^ (Jin (x) + J2n (a^)) dx, 
where Jin (x) and J 2 n (x) are the two centred Gaussian processes given in Theorem 2.1. After 
some elementary but tedious manipulations of integral calculus, we obtain 


X ^Jn(x)dx 




(5.29) 


T) + (1 _w„,, ( 

n J \ pJ \n 




n 


Since {Wn (s), 0<s<l}isa sequence Weiner processes, we may readily show that 


E 


W 1 

’ ’ n,l 


k 


n 


< (ps 


d/2 


and 


E 


W 


n,2 


k 


n 


< (qs) 


1/2 


But 7 i < 72 , hence 0 < g < 1/2 and thus it is easy to verify that E | x~^Jn (x) dx\ < 00 . 
This yields that l 2 n ^ 0 when n —)■ 00 (because l/\/k —)■ 0), as sought. As for the Gaussian 
representation result, we write y/k (71 — 71 ) = x~^Dn (x) dx, then by applying Theorem 
2.1 together with the representation (5.29) and the assumption, we have \/kAi (h) —?• A, we 
get Vk (71 - 7 i) = Cin + C 2 n + Csn + q- 1 + Op ( 1 ), where 


1 — r 


Cin := 7 ^/^ 


-9-iW, 


n,2 


k 


n 


-s ds, Gsn := 7 1- 


q 


p 


- s 


-i-iW, 


n,l 


-s 1 ds 
n 


k 


n 


and Gsn := — 7 iW y^n,! ( ~ • The computation of the limit of E [Cin + C 2 n + C^n\ gives 


2g7^ 


2 g 2 - 3g + 1 


+ 1 -- 


2^7^ 


p J 2g2 _ 3g -I- 1 


+ 2!_22!fi-?| = 


p 


p 


7 


p J p (2p — 1) ’ 


which by substituting p 7 i for 7 completes the proof. 

Concluding notes 


□ 
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On the basis of Nelson-Aalen nonparametric estimator, we introduced a product-limit pro¬ 
cess for the tail of a heavy-tailed distribution of randomly right-censored data. The Gaussian 
approximation of this process proved to be a very useful tool in achieving the asymptotic 
normality of the estimators of tail indices and related statistics. Furthermore, we defined 
a Hill-type estimator for the extreme value index and determined its limiting Gaussian dis¬ 
tribution. Intensive simulations show that the latter outperforms the already existing ones, 
with respect to bias and MSE. It is noteworthy that the asymptotic behavior of the newly 
proposed estimator is only assessed under the second-order condition of regular variation 
of the underlying distribution tail, in contrast to the unfamiliar assumptions of Worms and 
Worms (2014) and Einmahl et al. (2008). This represents the main big improvement brought 
in this paper. Our approach will have fruitful consequences and open interesting paths in the 
statistical analysis of extremes with incomplete data. The generalization of this approach 
to the whole range of maximum domains of attraction would make a very good topic for a 
future work. 


6. Appendix 

The following Proposition provides useful results with regards to the uniform empirical and 
quantile functions (t) and (t) respectively. 

Proposition 6.1. (i) For n > 1 and 0 < a < 1, we have 

sup {t/Vn = Op (1) = sup {t/Yn . 

a/n<t<l a/n<t<l 

(a) For n>l, 0<6<1 and 0 < r; < 1/2, we have 

sup lUn {t) — t\ /t^~^ = Op (1) = sup |V„ (t) — t\ /t^~^. 

0<t<l b/n<t<l 

{ill) Let if be a regularly varying function at infinity with index a and let 0 < On < bn < 1 
be such that nOn = O (1) and bn i 0. Then 

sup ((p(t)/(p(V„(t))) = Op(l). 

<t<bn 

Proof. The proofs of assertion (i) and the hrst result of assertion {ii) may be found in Shorack 
and Wellner (1986) in pages 415 (assertions 5-8 ) and 425 (assertion 16) respectively. The 
second result of assertion {ii) is proved by using the first results of both assertions (i) and 
{ii). For the third assertion {Hi) , it suffices to apply Potter’s inequalities (5.23) to function 
(p together with assertion {i) corresponding to Yn {t) ■ □ 

Lemma 6.1. Let In (a;) be the interval of endpoints {xZn-k-.n) o.nd {xZn-k-n) ■ Then 
sup sup (£ (t)/£ (Vn (t))) = Op (1), as n —)■ cx), 

X>p~< t&In{x) 
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where C (s) ;= F (s)) /H (s)) , with being the generalized inverse of H 


T7(i) 


Proof. First, we show that, for any small 0 < .^ < 1, there exists a constant c > 0, snch that 
In (x) is inclnded in the half-open interval [(1 — fn, ckin) , with probability close 

to 1, as n —>■ oo. Indeed, following similar argnments as those nsed in the proof of part (i) 
in Lemma 4.1 in Brahimi et al. (2015), we infer that for all large n 


{xZn-k-.n) = (1 + Op {l))pX-^/'^H {Zn-k:n) , 
nniformly on x > . We have H [Zn-k-.n) = (1 + Op (1)) k/n, which implies that 

P I (1 — px~^^^— < (xZn-k-n) < (1 + 0 px~^^'^ — \ — )■ 1, as n —)■ CX). 

n n 


(6.30) 


Since k > 1 and x > p'^, then 

(l-Opa;-^/^ (1) 


k 


n 


< H {xZn-k:n) < (1 + 0 “ t as n -)■ CX). 


n 


(6.31) 


On the other hand, it is obvions that for any x > p''', P {^^n-k-.n) > l/'oj = 1 
and (xZn-k-.n) = U„ [xZn-k-.n)^ , a-s. Then, in view of assertion [i) in Propo¬ 
sition (6.1) , we have (xZn-k-.n) = Op {xZn-k:n)j , it follows, from (6.30) , that 
{xZn-k-.n) = Op {k/n) , nniformly on x > p'*'. This means that there exists d > 0 snch 

that 


P i [xZn-k:n) < dk/u ^ 1, aS 77, —)■ CX). 


n 


(6.32) 


Therefore, from (6.31) and (6.32), that P {/„ (x) C [(1 — .^) px“^/'>'/n, cfc/n)} tends to 1 
as n —)■ CX), for any x > p'*', where c := max (d, 1-|-^), as songht. Next, let (x) := 
(1 — f) px~^0 jfi and bn ■= ck/n 0. It is clear that 0 < a„ (x) < < 1 and nOn (x) < 1 — 

that is nOn {x) = O (1) for any x > p'*'. Then, by nsing assertion [Hi) of Proposition 6.1, we 
get snpjgj^( 2 ,) (£ (t) /C (V„ (t))) = Op (1), as n —>■ cx), nniformly on x > p'^. □ 

Lemma 6.2. Let (3n o^d be the two empirical processes respectively defined in (5.18) and 
(5.19) . Then, for all large n and any 1/4 < p < p/2, we have 


(i) 1^^ (y;)| = Op (1) = 


W>1 


W>1 


fdn {w) 


Moreover, for any small cq > 0, we have uniformly on 0 < s < x 


(ii) Pn {in (S)) - Pn (*-’■) = Op , 


where ^n (s) is of the form (5.27) . 
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Proof. Recall that 


/3n (w) = ^ jon (6') -an(^0 - {wZ^-k-.n)) } ) for 0 < iwZn-k-.n) < 6, 

D 

and note that {«„ {9) — an {0 — t), 0 < t < 9} = {an (t), 0 < t < 9} . Then withont loss of 
generality, we may write (3n (w) = (wZn-k-.n)^ , for any w > 1. It is easy to 

verify that 

{wZn-k:n)) " {wZn-k:n) 


|/9nHI = 


( H^"\wZn-k:n)^" ' 


y/k 


w 


-1/7 


{wZn-k-.n)) 


ri-1 


Making nse of Proposition 6.1 (assertion {ii)), we infer that 


n' 


(wZn-k-.n)) - {wZn-k:n) 




iwZn-k:n)) 

nniformly on w > 1, for any 1/4 < r] < p/2. It follows that 

nl-n (H^^\wZn-k:n) 


= Op(l), 


W^^-'^^l^\l3n{w)\=0, 


w 


-1/7 


^ V 

On the other hand, by using Potter’s inequalities (5.23) to both F and G, with similar 
arguments as those used in the proof of Lemma 4.1 (assertion (*)) in Brahimi et al. (2015), we 
may readily show that lim;j^oo sup^^^ {zw) /H {z) — pw~^l^ = 0. Since —H {Zn-k-.n) ^ 

K 

1 , then |/3„ (ru)| = Op ((fc/n)'’uniformly on ty > 1. We have k —)■ 00 , 

k/n —)■ 0 and 1/2 — 2?] < 0, for any 1/4 < rj < p/2, therefore tyP-^lO (ty)| = Op (1), 
uniformly on ty > 1, leading to the hrst result of assertion [i). The proof of the second result 
follows similar arguments. For assertion (ii ), we first note that 

[f^n {in (s)) - Pn ,0 < S < l} = {/3„ (s) - ,0 < S < l} . 

By using assertion {i ), we write (3n {in (s) — = Op (s) — s-t'i ^ uniformly 

on 0 < s < and by applying (once again) Potter’s inequalities (5.23) to (! — •), 

we infer that in (s) — = Op . It follows that 

/Sn {in {S) - S-^-) = Op , 


which completes the proof. 


□ 
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